Introduce the dataset manifest and remove layer information from the partition table #11423

abey79 · 2025-10-03T15:53:59Z

What

Introduces gRPC endpoints and associated SDK method to access the dataset manifest table, which contains a row per layer. Also, remove most layer-related columns from the partition table.

This PR also attempts to solidify the notion that Scan{PartitionTable|DatasetManifest}Response is the One True Source(tm) of information on the returned dataframe's schema.

The OSS server does not yet implement the dataset manifest (RR-2482).

add new grpc endpoints to proto
update ext utilities
fix OSS server
add table provider
rename everything to DatasetManifest (from LayerTable)
update Python SDK API
~~update catalog?~~ Add dataset manifest table to the catalog provider #11444

github-actions · 2025-10-03T15:54:22Z

Web viewer failed to build.

Result	Commit	Link	Manifest
❌	`ff20d4e`	https://rerun.io/viewer/pr/11423	`+nightly` `+main`

^{Note: This comment is updated whenever you push a commit.}

Copilot

Pull Request Overview

This PR introduces the dataset manifest table functionality and removes layer-specific columns from the partition table. The dataset manifest provides layer-level metadata while the partition table now focuses solely on partition-level information.

Adds new gRPC endpoints for dataset manifest schema and scanning operations
Refactors partition table structure to remove layer information and add partition metadata
Implements dataset manifest provider for DataFusion integration

Reviewed Changes

Copilot reviewed 12 out of 12 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
rerun_py/src/catalog/dataset_entry.rs	Adds manifest() method to expose dataset manifest as DataFusion table
rerun_py/rerun_bindings/rerun_bindings.pyi	Python type hints for new manifest() method
crates/store/re_server/src/store.rs	Updates partition table schema removing layer columns and adding metadata
crates/store/re_server/src/rerun_cloud.rs	Implements placeholder gRPC handlers for dataset manifest endpoints
crates/store/re_redap_client/src/lib.rs	Adds error variant for dataset manifest schema operations
crates/store/re_redap_client/src/connection_client.rs	Implements client method for dataset manifest schema fetching
crates/store/re_protos/src/v1alpha1/rerun.cloud.v1alpha1.rs	Generated protobuf code for new dataset manifest endpoints
crates/store/re_protos/src/v1alpha1/rerun.cloud.v1alpha1.ext.rs	Schema definitions and helper methods for dataset manifest responses
crates/store/re_protos/proto/rerun/v1alpha1/cloud.proto	Protocol buffer definitions for dataset manifest endpoints
crates/store/re_datafusion/src/partition_table.rs	Adds TODO comment for deduplication
crates/store/re_datafusion/src/lib.rs	Exports new DatasetManifestProvider
crates/store/re_datafusion/src/dataset_manifest.rs	Implements DatasetManifestProvider for DataFusion integration

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

crates/store/re_protos/src/v1alpha1/rerun.cloud.v1alpha1.ext.rs

crates/store/re_datafusion/src/dataset_manifest.rs

Co-authored-by: Copilot <[email protected]>

abey79 changed the title ~~Add grpc endpoint for layer table and cleanup helper objects~~ Introduce the layer table and remove layer information from the partition table Oct 3, 2025

abey79 added sdk-python Python logging API include in changelog dataplatform Rerun Data Platform integration labels Oct 3, 2025

abey79 added 2 commits October 6, 2025 15:51

Add grpc endpoint for layer table and cleanup helper objects

6850986

add table provider for layer table

1cd8a69

abey79 force-pushed the antoine/layer-table branch from 701aed1 to 1cd8a69 Compare October 6, 2025 14:21

abey79 added 3 commits October 6, 2025 16:42

add DatasetEntry.layer_table to Python SDK

1ce0fc5

reintroduce storage_urls in partition table

33f7409

Fix schema mismatch

abc37b0

abey79 changed the title ~~Introduce the layer table and remove layer information from the partition table~~ Introduce the dataset manifest and remove layer information from the partition table Oct 7, 2025

abey79 added 2 commits October 7, 2025 09:21

Rename everything to "DatasetManifest"

feb1b31

Fix name + update proto docstring

b070cbe

abey79 requested a review from Copilot October 7, 2025 07:30

Copilot AI reviewed Oct 7, 2025

View reviewed changes

crates/store/re_protos/src/v1alpha1/rerun.cloud.v1alpha1.ext.rs Outdated Show resolved Hide resolved

crates/store/re_datafusion/src/dataset_manifest.rs Outdated Show resolved Hide resolved

crates/store/re_datafusion/src/dataset_manifest.rs Outdated Show resolved Hide resolved

abey79 and others added 9 commits October 7, 2025 09:34

Apply suggestion from @Copilot

5d43d14

Co-authored-by: Copilot <[email protected]>

Apply suggestion from @Copilot

6bba8ab

Co-authored-by: Copilot <[email protected]>

Minor fix

a82dbb4

Merge branch 'main' into antoine/layer-table

fcd9b23

Minor minor fix

648d46a

Add explicit fields() method

af5c3df

Add explicit xxx_inner_field() methods

8778501

Remove utterly deprecated constants

3e5baa0

More docstring and rename to LAYER_NAMES

ff20d4e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Introduce the dataset manifest and remove layer information from the partition table #11423

Introduce the dataset manifest and remove layer information from the partition table #11423

abey79 commented Oct 3, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Oct 3, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Introduce the dataset manifest and remove layer information from the partition table #11423

Are you sure you want to change the base?

Introduce the dataset manifest and remove layer information from the partition table #11423

Conversation

abey79 commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related

What

Uh oh!

github-actions bot commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

abey79 commented Oct 3, 2025 •

edited

Loading

github-actions bot commented Oct 3, 2025 •

edited

Loading